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DNA expression In i ransfected Ceils And 
/' ^ Assays Carried Out In Transfected Cells 

v' 

/' 

This invention relates to methods of expressing DNA in cells, to vectors for 
expression of DNA in cells and to transfected cells. The invention also relates to 
assays carried out in transfected cells cr dffferentiated derivatives of such cells. In 
particular the invention relates to transfecticn of and expression of DNA in embryonic 
stem (ES) ceils. 

The wealth of sequence information now becoming available from the aenome 
projects demands the development of new, high throughout systems for functional 
analysis. A powerful route to discovering and characterising denes involved in 
determination and differentiation in mammals is potentially available via the genetic 
manipulation of ES ceils in vitro, 

ES ceils, which are derived from the pluripoteniiai inner cell mass (1CM) of the 
□reimplantation mouse embryo (2,3), retain the capacity for muitiiineage differentiation 
ccth in vitro (4,5) and in vivo (5,7). In principle, therefore, gene products which 
influence developmental decisions should be assayable in ES cell culture systems, 
'whatever the source of the cells. However, there are major difficulties in analysing 
cDNA function by ES ceil transfecticn. The frequency of isolating stable transfectants 
is iow (<1CT by electroporation, caicium phosphate co-precipitation or iipofection) and 
tne great majority of transfectants show heterogeneous and unstable expression. 

i hese problems are particularly significant in the case of cDNAs whose expression 
causes differentiation because differentiated ES cell progeny do not generally 
pro iire rate. In such cases transfectants may still be isolated but transgene expression 
will be minimal. 

roisoma! vectors have been used for functional screening in other cell types in order 
to increase the frequency of stable transfecticn and to achieve reliable transgene 
excression. However, previously described episomal vectors, for example based on 
rostein-3arr virus (E3V) cr bovine papilloma virus (3PV), have limitations both in 
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nest ceii range and maintenance curing long-term culture. 

A modified extrachromosomal vector is known based on the replication system of 
murine polyoma virus (8). This piasmid, pMGD20neo, can be stably maintained as 
an episome in ES cells during long term culture. Importantly, the low levels of large 
T protein produced have no overt effect on the growth or differentiation properties of 
tne ES cells (8,9). It is also known to maintain simultaneously with pMGD20neo a 
second episomal vector Expression from the second vector was not possible hence 
pMGD20neo was used for cDNA expression. However, this vector already comprises 
two expression cassettes, one each for large T antigen and the neo selectable marker 
so its size constrains its use for expression of a third cassette containing a cDNA. 

It is an object of the invention to provide a vector for transfection of and expression 
of DNA within a cell and a method of expressing DNA in a cell that overcomes or at 
least ameliorates the disadvantages identified in the art. An object of at least the 
preferred embodiments of the invention is to achieve, in a transfected cell, expression 
that is more stable and more homogenous than hitherto attainable. Further objects 
of preferred embodiments of the invention are to provide a method of expressing a 
DNA in an embryonic cell in a more stable and more homogenous manner than 
hitherto attainable, and to provide for stable transfection of embryonic cells at a 
higher frequency than can be obtained using conventional vectors. 

The invention is based upon the maintenance of a vector within a cell, wherein 
maintenance of the vector is dependant upon the continued presence within the cell 
of a certain factor and wherein that factor is not expressed by the vector but is 
produced in or present in the cell in an amount sufficient to maintain the vector. 

Accordingly the invention provides a transfection and expression method comprising, 
in a cell that expresses or will express a replication factor, introducing a vector 
dependant upon that replication factor. Thus, in a first aspect, the invention provides 
a method of expressing a DNA in a ceil, comprising: 

(a) (i) transfecting the cell with a first vector that expresses a 
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repiicaiion factor; or 
(ii) otherwise obtaining a cell that expresses or will express the 
replication factor; 

and 

(b) transfecting the ceil with a second vector, wherein 

(i) the second vector contains a DNA, or is adapted to receive a 
DNA, in operative combination with a promoter for expression of 
the DNA; and 

(ii) extrachromosomal replication of the second vector is dependant 
upon presence within the cell of the replication factor. 

The replication factor is optionally non-toxic to the cell. Alternatively, the replication 
factor is toxic to the cell at high levels of expression but at low levels of expression 
is substantially non-toxic to the cell but at these low levels is present in sufficient 
amount to enable replication of the second vector. 

Further, the replication factor preferably does not after the ability of the cell to 
differentiate or proliferate, and may thus be regarded as being neutral to the cell 
phenotype. This enables the activities of the product of a cDNA to be investigated 
over a long time period and many cell generations without having to take account of 
possible interfering effects of the replication factor present within the cell. Again, the 
replication factor may be phenotype-neutral -at all levels or may be neutral at a low 
level which is nevertheless a sufficient level to maintain the second vector within the 
cell. 

The invention is of application to all cell types for which there exists, whether from a 
natural or synthetic source, a replication factor capable of maintaining in that cell type 
an episomal vector. The vector is preferably stably maintained, meaning it is 
maintained over a number of cell generations, and at least over 3 generations. The 
ceil is preferably selected from the group consisting of mammalian cells, in particular 
primate cells or murine cells, and avian cells. It is further preferred that the cell is an 
embryonic cell, in particular an ES, EC (embryonic carcinoma) or EG (embryonic 



WO 9832868 



PCT/GB98/00216 



gonadal) cell, or differentiated progeny of any such cell. 

While reference is made to the second vector, it will be appreciated that the 
replication factor is optionally present in the cell other than following transfection with 
a first vector. For example a culture of cells that already express the replication factor 
may be obtainable from a third party. 

In an embodiment of the invention described in detail below, the method comprises 
iransfecting an ES cell with a first vector that expresses a viral replication factor, and 
thereafter transfecting the ES cell with a second vector that expresses a cDNA and 
is dependant upon presence of the viral replication factor for its extrachromosomal 
replication within the ES cell. The frequency of the first transfection step is generally 
low and may result in as few as 1 in 10 5 successful stable transfectants - this level 
of success is recognised as typical in this art. However, the second transfection has 
surprisingly and advantageously found to result in a significantly higher frequency of 
successful stable transfectant colonies being obtained. The second transfection can 
be carried out with a 1% or higher success rate, which represents a 100-fold 
improvement over the art. 

One suitable viral replication factor for mouse cells, in particular mouse ES cells, is 
polyoma large T antigen, in which case the cell of step (a) expresses the polyoma 
large T antigen and the second vector comprises an origin of replication that binds 
the polyoma large T antigen, such as the polyoma replication origin, referred to as 
Ori. Another suitable viral replication factor for primate cells is based upon Epstein 
Barr virus, in which the primate cell of step (a) expresses the EBNA-1 antigen and 
the second vector comprises an origin of replication that binds EBNA-1 , such as OriP. 
Viral replication factors are generally species - specific and so expression of DNA 
according to the invention is dependent upon choice of a replication factor appropriate 
to the' cell. Polyoma large T has been described for use in mouse cells. E/JNA-1 is 
suitable for human cells. Still further systems are optionally based on papilloma virus 
replication factors, for human cells, or SV40 virus large T antigen, for simian cells, 
and further suitable replication factors may also be selected from functional variants, 
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cenvatives and analogues of these replication factors, such as temperature sensitive 
variants. 



In use, the second vector is constructed according to standard techniques so as to 
contain a cDNA sequence or insert of interest operativeiy combined with a promoter 
io express the cDNA. The second vector is used to transfect an ES cell already 
expressing a replication factor and successful transfecianis are recovered in which 
it is found that the second vector is stably maintained within the ES cell and 
expresses the cDNA with a more homogenous pattern than when prior art techniques 
are followed. Thus, the invention provides an advantageous method for expression 
of a cDNA in a cell. 

in this context, "homogenous" in relation to expression of a cDNA in a colony of 
iransfected ES cells is used to indicate that most cells, or a large proportion of cells, 
or preferably most cells, or more preferably substantially all cells, express the cDNA 
and "stable" is used to indicate that the cells continue to express the cDNA at a 
similar level and preferably at substantially the same level. In the examples carried 
out to date and described below, homogenous transfeciion is seen with the method 
of the invention to a greater extent than in the art methods. Also, in the examples 
carried out to date and described below the method results in more stabie expression, 
meaning that expression does not alter over time. This has the advantage that study 
of the long term effectsof a cDNA product is facilitated. 

It is optional for the cell of step (a) first to be obtained or prepared by transfection of 
a cell by a first vector and for this then to be used for the starting cells for carrying 
out a plurality of separate transfections by second vectors containing different DNA 

•inserts coding for different DNA products of interest. Following this procedure, the first 
transfeciion may be carried out with the level of success typically seen in 

conventional techniques and the ES cells obtained divided into separate colonies, 
i ne second transfections, introducing the DNA insert in the second vector, are then 

earned out with the higher levels of success typically seen in the methods of the 

invention. 
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In the case that the method comprises transfection with first and second vectors, it 
is preferable for the first vector to code for a selectable marker and for the second 
vector also to code for a selectable marker, though a different one: In a specific 
embodiment of the invention described befow, the first vector codes for hygromycin 
resistance and the second codes for neomycin resistance. This allows selection of 
ES cells in which transfection by both first and second vectors has been successful. 

It is a further embodiment of the invention for the method to comprise an additional 
transfection step with a third vector, wherein the third vector contains a cDNA, or is 
adapted to receive a cDNA, in operative combination with a promoter for expression 
of the cDNA t and extrachromosoma! replication of the third vector is dependant upon 
presence within the ES cell of the replication factor. Transfection with the third vector 
is optionally at the same time as transfection with the second vector or subsequent 
thereto. 

yhe second and third vectors preferably each comprise a selectable marker enabling 
selection of ES cells in which transfection has been successful. The respective 
selectable markers are preferably different if the method comprises transfection with 
both second and third vectors, and preferably different again from the selectable 
marker of the first vector. 

It is a feature of particular embodiments of the invention that the second vector (and 
third or subsequent vectors if present) are not able to express the replication factor. 
In fact, in construction of the second vector from a vector comprising DNA encoding 
the replication factor it is preferable for that DNA to be largely or substantially 
completely deleted. 

in a specific embodiment of the invention, the first vector is pMDG20neo and 
expresses polyoma large T antigen and the second vector comprises the natural 
target for polyoma large T antigen, namely Or/, expresses a cDNA of interest but 
does not express large T antigen, in use, the large T antigen is expressed by the 
first vector and binds to Ori of the second vector when it enters an ES cell, thus 
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enabling replication of the second vector and its maintenance within the ES ceil in an 
extrachromosomal state, in successful transfectants, the vector remains 
extrachromosomal, and this is beiieved to render the vector relatively immune from 
etiacis seen when a vector is integrated into the host ES cell genome, which effect 
may include silencing of the cDNA resulting in unstable and heterogeneous 
expression. 

An alternative to use of the first episomal vector is to introduce into the cell a 
construct that expresses the replication factor and integrates with the cell genome. 
The construct should therefore include a DNA sequence coding for the replication 
factor and means for selection of ceils in which the construct has successfully 
integrated; one example is a construct that comprises cDNA coding for, in order, large 
T antigen - an internal ribosome entry site (IRES) - Bgeo. A culture of cells is then 
obtained by selecting for cells that express the selectable marker, such as in this 
case by selection in G418. Staining with Xgal is used to identify transfectant clones 
which show stable and homogenous expression. The construct preferably comprises 
a promoter that gives stable, low level expression in transfected cells, such as the 
HMGCoA promoter for ES cells. The ceils obtained can then be subjected to 
transfection with the second and optionally third and subsequent vectors. 

in another embodiment of the invention the second vector comprises an inducible 
promoter. Some types of differentiated cells, derived from ES cells, can only be 
obtained with any reliability if a particular differentiating factor is expressed after a 
prior event. One example is neurone formation which generally only occurs after 
aggregation of cells. Thus, using an inducible promoter, expression of DNA that 
codes for the factor that leads to neurone formation can be controlled until the ES 
cells have suitably aggregated. Interferon responsive promoters are some examples 
of inducible promoters. Alternatively, the cDNA is designed to be in a non-functional 
form and to be capable of being modified into a functional form at a later time. One 
possibility is for the cDNA to be disrupted for example by termination sequences 
which are flanked by target sites for a site specific recombinase, such as loxP sites, 
removable by Ore recombinase, or frt sites removable by Flp recombinase. Ore and 
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FIp can be fused to steroid hormone receptors in order to make their activity 
regulatabie. After administration of steroid the Cre or rip recombinase will translocate 
to the nucleus and there convert the cDNA into a functional form by excision of the 
disrupting sequence. It may also be desired to stop or inhibit or reduce replication 
of the second vector; the method optionally comprises using a site specific 
recombinase to present replication of the second vector. This can be achieved by 
deletion of a sequence in the vector to which the replication factor must bind in order 
for the vector to be replicated by the host cell. 

The term DNA or cDNA is usually understood to refer to a DNA sequence that is 
transcribed into a mRNA that is translated into a polypeptide or protein. In the present 
invention the term is also intended to encompass any product" of DNA expression. 
It thus includes DNA coding for an antisense RNA t or for an antisense ribozyme 
molecule. 

The method of the invention is suitable for assaying effects of DNA expression, due 
to the stability and efficiency of expression achievable. Accordingly, the invention 
further relates to an assay for the effect of presence in a ceil of any product of DNA 
expression - such as protein, polypeptide, antisense RNA, ribozyme RNA, transfer 
RNA or other. The method comprises steps (a) and (b) as described above wherein 
the second vector also contains a DNA coding for a selectable marker. The method 
further comprises selecting for cells that have been transfected with the second 
vector and maintaining the selected cells over a plurality of generations. 

Step (a) may be carried out once and then steps (b) onwards repeated for different 
assays, and the method is of particular application to screening a cDNA library. 
Furthermore, two or more cDNAs can be expressed in the same cell to assay the 
effect of the combination of their respective expression products. 

The invention also relates to a vector. Accordingly, the invention provides, in a 
second aspect, a vector for transfection of an ES ceil, wherein: 

(i) the vector contains a DNA, or is adapted to receive a DNA, in operative 



WO 98/3286$ 



PCT/GB98/00216 



combination with a promoter for expression of the DNA; 

(ii) exirachromosomal replication of the vector is dependant upon presence 
within the ES cell of a replication factor; and 

(iii) the vector does not express the replication factor. 

The vector is characterized in preferred embodiments as described above in relation 
to the second vector of the first aspect of the invention. 

it is an advantage of at least preferred embodiments of the invention that due to very 
high efficiency of stable secondary transfection (supertransfection) of cells, for 
example transfection of ES cells harbouring pMGD20neo with a second plasmid 
containing the polyoma replication origin (Ori) (8), that expression of DNA is stably 
and efficiently achieved from the second plasmid. 

Another aspect of the present invention provides a method of screening for new 
DNAs that encode signal sequences and proteins that are transported to the cell 
surface. The invention according provides a method of investigating the properties 
of a DNA sequence comprising expressing in a cell a composite DNA including (a) 
the DNA sequence under investigation, linked to (b) a DNA coding for a cell active 
protein, wherein 

activity of the cell active protein is dependant upon transport of the cell active 
protein to the cell surface, and 

the DNA of (b) does not code for a polypeptide capable of directing 
transportation of the eel! active protein to the cell surface. 

This offers the advantage that where the DNA of interest does indeed code for a 
sequence that transports a polypeptide to the cell surface, whether thai polypeptide 
remains there or is ultimately secreted, this will be apparent from observation that the 
cell active protein has had or is having its known effect. Thus the method offers a 
convenient means of identifying DNA sequences that will transport proteins to the cell 




PCT/GB98/00216 



The method is suitably used for screening a library of DNAs to identify DNA 
sequences coding for signal polypeptide sequences that transport proteins to the cell 
surface. The cell active protein if transported to the cell surface may remain there 
or be secreted by the cell, and this distinction may be separately assayed, or 
example by examination of the make-up of the culture medium before and after the 
investigation. 

One convenient way to obtain the DNA of (b) is by deleting or disabling, from a DNA 
encoding a cell surface or secreted protein, that portion of the DNA that codes for the 
polypeptide sequence responsible for transportation of the protein to the cell surface. 
The cell active protein is optionally a cell surface receptor and the DNA of (b) can 
thus encode a modified form of the receptor preprotein lacking a functional signal 
sequence. In a specific embodiment described below the IL-6 receptor is used as 
expression of the receptor in ES cells can be used to inhibit differentiation of the cells 
- a readily observable property of the cell active protein. Gross morphological or 
proliferative changes induced in the cell by the cell active protein are of course readily 
observed, though the invention is of application to any cell active protein whose 
activity, when it is transported to the cell surface and / or secreted, can be assayed. 

A specific embodiment of this aspect of the invention comprises expressing the 
composite DNA by: 

(a) (i) transfecting a cell with a first vector that expresses a replication 

factor; or 

(ii) otherwise obtaining a cell that expresses the replication factor; 

(b) transfecting the cell with a second vector, wherein 

(i) the second vector contains the composite DNA in operative 
combination with a promoter for expression of the composite 
DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
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marker in operative combination with a promoter for expression 
of the selectable marker; and 



(iii) extrachromosomal replication of the second vector is dependant 
upon presence within the cell of the replication factor; 

(c) selecting for cells that have been transfected with the second vector; 
and 

(d) maintaining the selected cells over a plurality of generations so as to 
assay the effect of expression of the composite DNA. 

If many investigations are to be carried out it is preferred that step (a) is carried out 
once and the cells obtained are divided and used for a plurality of separate methods 
in which steps (b)-(d) are carried out a plurality of times With second vectors 
containing different DNA sequences. This offers the advantage that typically the first 
transfection step is of lower efficiency than the second, so the method avoids having 
to repeat the low efficiency step too often. 

It is particularly preferred that the method is used for identification of a DNA coding 
for a cell surface or secreted protein, and using the method to screen a library of 
DNAs provides a means of carrying out the screen for discovery of such DNAs and 
investigation of their properties. More especially, the method is for discovery of 
hitherto unknown or un characterized cell surface or secreted proteins, or for location 
of the coding sequence of known proteins of this type. 

This aspect of the invention optionally further incorporates in preferred embodiments 
features of transfection of cells described above in relation to other aspects of the 
present invention. 

i he invention enables development of a series of vectors which give highly efficient 
and robust expression of transgenes in cells. Cloned cDNAs of interest can rapidly 
be characterised using this system. It is also applicable to the discovery of novel 
regulatory molecules through functional expression screening of cDNA libraries. 
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Due to their pluripotent and proliferative character, key cellular processes such as 
viability, propagation, determination and differentiation, can be analyzed in transfected 
ES cells. The "supertransfection" system of the invention overcomes the limitations 
associated with conventional cDNA transfection and opens a powerful new route to 
gene discovery and characterisation in mammals. 

Key features of the episomal supertransfection system, described according to the 
examples below, are that very high efficiencies of stable transfection are obtained and 
that cDNA expression is homogeneous, stable and reliably dictated by promoter 
strength. The increased efficiency of isolating stable iransfectants is significant 
because it allows reliable detection of cDNAs whose expression results in cell death 
or differentiation. In addition a high transfection efficiency is generally advantageous 
for any high throughput assay system and is essential for functional cDNA library 
screening. The reliability of cDNA expression is critical for functional studies and the 
robust nature of expression from episomal vectors contrasts favourably with the 
variable and unstable expression observed in conventional ES ceil transfectants. 

Heterogeneous expression of integrated transgenes is not an artefact arising from use 
of bacterial lacZ as a reporter gene, firstly because similar observations have been 
made using mammalian thy-1 as a reporter in F9 cells, and secondly because 
ubiquitous expression of lacZ can readily be obtained following gene trap integrations 
(23,24). The expressiorf pattern throughout the population cannot be determined by 
Northern blot but can only be revealed by in situ hybridization or use of a linked 
reporter gene such as IRES-/acZ (25) Heterogeneous expression, which previously 
occurred in the great majority of transfected clones following stable integration, gave 
unclear or misleading results on the phenotypic consequences of transgene 
expression. 

The difference in expression pattern between conventional transfectants and episomal 
supertransfectants of the invention arises because an extrachromosomaL copy of a 
transgene is not subject to alteration during the integration process norxo modification 
arising from the genomic sequences flanking an integration site. The so-called 
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"position effect" can modify both ine level and pattern of transgene expression in 
stable transfectants. Furthermore, the expression of integrated iransgenes is often 
suppressed over several generations in ES ceil cultures. This silencing phenomenon 
contributes to the high backgrounds which can be obtained in double replacement 
type targeting strategies (26) . It has been observed in stable transfectants with 
different iransgenes driven by virai promoters or minimal mammalian promoters such 
as the widely used human /?-actin and mouse PGK-1 promoter elements. One 
hypothesis to explain this phenomenon is that transgenes may become targets of de 
novo methyltransferase in stem cells (27). Macleod et al. (28) reported that a 
methylation free locus could be generated in transgenic mice by introduction of the 
whole CpG island of the aprt promoter, 

•% 

Whatever the molecular mechanism of silencing, it appears not to occur to episomaiiy 
maintained transgenes in vectors of the invention. In addition, the level of expression 
obtained from vectors of the invention is reliably dictated by promoter strength and 
can predictably be varied over at least a 10-fold range by appropriate choice of 
promoter. Episomal constructs of the invention thus offer considerable advantages 
for functional expression studies in ES cells. 

Functional cDNA expression cloning is a powerful method for direct isolation of 
important genes. The expression screening approach has often been employed to 
isolate cDNAs encoding surface and secreted molecules via transient expression, for 
example in COS cells. In a few cases E3V-based systems have also been applied 
to isolate intracellular regulatory genes via stable expression in the target ceils (29- 
32) . The high efficiency of supertransfection in the polyoma system of the invention 
indicates that this approach could be applied to functional cloning in ES cells. Based 
on a transfection efficiency of 2.5%, a library of 5x1 0 5 cDNA clones could be 
screened by electroporation of 2x1 0 7 cells with 100//g DNA. For an effective library 
screen, the majority of transfectants should only take up a single piasmid. It is also 
advantageous if the cDNAs can readily be recovered in unrearranged form. Both of 
these conditions are satisfied by the episomal supertransfection system. By 
screening libraries prepared from undifferentiated ES cells it may be possible to 
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isolate cDNAs whose products mediate self-renewal. In this case direct selection can 
be applied for colony formation in the absence of LIF. For cDNAs whose products 
direct differentiation, however, it will be necessary either to screen poofs through 
several rounds or to incorporate an inducible promoter into the episome. 

Recently, several improved protocols for in vitro differentiation of ES cells have been 
reported, which promote efficient generation of, for example, haematopoietic celis (33) 
, neurons (34) or cardiomyocytes (35). The episomal expression strategy of the 
invention can be applied for gain-of-function assays and screens during these 
differentiation programmes. It can also be used for loss-of-function analyses via 
overexpression of anti-sense RNA or dominant-negative mutants. Combination of 
these differentiation systems with the episomal expression ^system will therefore 
provide powerful tools for analysing cell determination and differentiation events. 

The invention is now described with reference to the accompanying drawings in 
which: 

Fig. 1 shows the structure of the episomal expression vector pHPCAG; 
Fig. 2 shows supertransfection efficiency of pHPCAG in MG1.19 ES cells; 
Fig. 3 shows DNA hybridisation analysis of Hirt supernatants from 
s u pe rt ra n sfe ctants ; 

Fig. 4 shows the effect of vector size on supertransfection efficiency; 
Fig. 5 shows expression of /?-galactosidase in MG1.19 tran sfe ctants; 
Fig. 6 shows the restriction pattern of plasmid DNAs recovered from 
pHPCAG-/acZ supertransfectant clone; 

Fig. 7 shows induction of differentiation by expression of STAT3F in MG 
1.19 ES celis; 

Fig. 8 shows co-supertransfection of STAT3F with wild type STAT 
expression vectors; 

Fig. 9 shows linker sequences for use in an assay of the invention; 

Fig. 10 shows DNA sequences coding for truncated and modified IL6R; and 

Fig. 11 shows a vector for use in an assay of the invention. 
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lore cetan: 



Figure 1 shows the structure of the episomal expression vector pHPCAG. cDNAs can 
be introduced between two BstXl sites using BstXl adaptors. Abbreviations: ALT20: 
deleted polyoma large T expression cassette LT20; Pyori/enh: mouse polyoma virus 
replication origin and mouse polyoma mutant enhancer derived from F101 strain; 
SVpA: SV40 poiyA addition signal; PGKhphpA: hygromycin B phosphotransferase 
gene expression cassette with mouse phosphogiycerokinase-1 (PGK) promoter and 
poiyA addition signal; CAG: combined CAG expression unit; yS-globinpA: rabbit /?- 
giobin poiyA addition signal; SVori: SV40 replication origin; ColElori: ColEl 
replication origin; amp: £. coli /^lactamase gene conferring resistance to ampiciilin. 

Figure 2 shows supertransfection efficiency of pHPCAG in MG1.19 ES cells. 

(A) shows numbers of transfectant colonies per microgram of pHPCAG DNA. 5x1 0 5 
MG1.19 ES cells were supertransfected with the indicated amounts of supercoiied 

pHPCAG followed by selection with hygromycin B for 8 days. The resulting number 
of drug-resistant colonies were scored and efficiency per fjg DNA calculated. 

(B) shows total numbers of transfectant colonies plotted against total amount of 
plasmid DNA. 

Figure 3 shows DNA hybridisation analysis of Hirt supernatants from 
supertransfectants. Hirt supernatants were prepared from 5x1 0 5 parental MG1.19 
cells and pooled pHPCAG supertransfectants. 1/20 of each sample was digested 
with either Eco Rl or HindUl and analyzed by filter hybridisation using a 344bp Sea 
\Sspl fragment from pUC19 which is common to both pMGD20neo and pHPCAG.. 

Figure 4 shows the effect of vector size on supertransfection efficiency. 20^/g of each 
of the supercoiied vectors pLT20ANde\hph (4.7), pLT20ABs£X]/7p/7 (5.5), 
pLT20AAIwN\hph (5.6), pLT20ASaclhph (5.9), ptkp (6.2), pSV40e/p (6.4), 
PGK/7p/7ALT20 (6.5), pmPGKp (6.5), phBAp (6.6), pHPCAG (7.7), ptk^lacZ (8.9), 
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pSV40e/p-/acZ (9. 1 ), orrPGKp-lacZ (9.3), phBAp-/acZ (9.3), and ph'?CAG-/acZ (1 0.4) 
were individually supertransfecied into 5x1 0 5 MG1.19 ES cells. The resulting 
numbers of hygromycin 3 resistant colonies were scored after 8 days, Transfection 
efficiencies are normalised relative PGKhohALT20. 



Hcure 5 shows expression of /?-galactcsidase in MG1.19 transfectants. Primary 
colonies were stained with Xgal after 8 days of selection. 



(A) shows typical homogeneous staining pattern obtained following supertransfection 
with supercoiied pHPCAG-/acZ. 

(S)shows heterogeneous staining pattern obtained in minority of clones following 
supertransfection with supercoiied pHPCAG-/acZ. 



(C) shows heterogeneous staining pattern typically observed following electroporation 
of linearized pHPCAG-/acZ and stable integration. 

(D) shows rare faint staining pattern obtained after supertransfection with supercoiied 
pHPCAG-/acZ. 

Figure 6 shows the restriction pattern of plasmid DNAs recovered from pHPCAG-/acZ 
supertransfectant clone. 



A supertransfectant MG1.19 clone carrying pHPCAG-/acZ was cultured for 60 days 
in the presence of hygromycin B. Hirt DNA was then prepared " and 
electrotransformed into E.coli DH10B cells. Plasmid DNAs were recovered from 
transformants, digested with EcoR\, resolved by electrophoresis on 1.0% agarose gel 
and visualised by ethidium bromide staining. Expected fragment sizes: pMGD20neo, 
4852bp and 2884bp; pHHPCAG-/acZ, 3S97bp, 2810bp t 733bp and 397bp. Lane 1: 
size marker (1kb ladder:3RL); lane 2: control pMGD20; lane 3 : control pHPCAG- 
/acZ; lane 4: recovered pMGD20; iane 5,5: recovered pHPCAG-/acZ. 
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riGure 7 snows induction of differentiation by expression of STATS F in MG 1.19 ES 
cells. 

(A)shows proportion of differentiated colonies in LIF-supplemented medium resulting 
from supertransfection of STAT3, antisense STAT3 and STAT3F expression vectors. 
Colonies were fixed and stained with Leishman's reagent after 8 days selection and 
numbers of stem ceil coionies and differentiated colonies scored. 

(3) shows marker gene expression in STAT3F supertransfectants: Expression of 
marker genes in pools of MG 1.1 9 cells supertransfected with STAT3 (lane 1), STAT3 
antisense (lane 2) and STAT3F (lane 3) expression vectors. Total RNA was prepared 
after 8 days of selection in LiF-supplemented medium and 5/yg^aliquots analyzed by 
filter hybridisation with /?-globin, Rex-1, H19 and G3PDH probes. The/?-gIobin probe 
detects all transgene mRNA species generated from pHPCAG, including an 
alternatively spliced product from the antisense construct. 

(C)shows photomicrographs of representative colonies 8 days after supertransfection 
with (i) STAT3, (ii) STAT3F, and (iii) empty expression vectors and selection in the 
presence of LIF, or, (iv) induction of differentiation by culture in the absence of LIF 
for 8 days. 

F ' aure 8 shows co-sup-ertransfection of STAT3F with wild type STAT expression 
vectors. Proportions of undifferentiated stem cell colonies generated after co- 
supertransfection of MG1.19 ES cells with 10//g p3PCAGGS-STAT3F plus 10//g 
pHPCAG vector containing stuffer (control), STAT3, STAT1 or STAT4 inserts. After 
8 days selection with 80//g/ml of hygromycin 3 plus 20/yg/mi of blasticidin S, colonies 
were fixed and stained with Leishman's reagent. 
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EXAMPLE 1 

Materials and Methods 

Vector constructions. 

Standard recombinant DNA methods were used to construct a!! plasmids(10) . 
Plasmid pHPCAG (Fig 1) was constructed from P MGD20neo(8) . The PGKneopolyA 
s««u-nce was replaced by a hygromycin resistance marker, PGKhpnpA, and large 
7 s-cuences were deleted (see Results). A Sa/l-Scal fragment containing the CAG 
-xor-ssion unit, a SsfXI stuffer sequence, the polyA addition signal derived from the 
rabbit /J-giobin gene and an SV40 replication origin (11) was inserted. Cod.ng 
sequences for ^-galactosidase. LIF or interieukin-2 were introduced between the 
BstXI sites. 

For construction of episomal expression vectors with alternative promoters, the Sa/I- 
Xbai fragment containing the CAG expression unit in pHPCAG-ZacZwas replaced wrth 
the 344 bp SV40 enhancer/promoter (SV40e/p), the 466 bp human /?-actin promoter 
(hBA) the 502 bp mouse phosphoglycerate kinase promoter (mPGK) and the 90 bp 
HSV-tk minimal promoter (tk), resuKing in P HPSV40e/p-/acZ. pHPhBA-/acZ, 
pH?rn p G K-/acZ and P HPtk-/acZ, respectively. 

=oisomal vectors with alternative selection markers were constructed by replacing the 
PGKhpnpA cassette in pHPCAG with the SVosrpA- cassette caring the E.col, 
biasticidin S deaminase (bsr) gene denved from pSV2bsr (Waken Seiyaku) or the 
hCWVzeopA cassette carrying the Srrepfoa/fofefctos bleomycin resistant gene (Sh 
We) derived from pZeoSV (InvSrogen) to generate oBPCAGGS and pZPCAGGS. 
respectively. 

Cell culture and transfection. 

M r-1 19 ES cells are derivatives of the CCE line which stably maintain around 20 
ooisomal copies of pMGDneo(S) . They were maintained on gelatin-coated plates m 
Glasgow modified Eagle's medium (GMEM. Gibco-3RL) supplemented wrth 10% fetal 
' cal- serum, 0.1 mM /Nmercaptoethanol. non-essential amino acids. 200 „g/ml G418, 
and lOOU/ml LIF oroduced in COS-7 ceils(11,12) . For superinfection, routmely. 
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5x1 MG1.19 ceils were suspended in 800 fj\ of P3S, incubated with 20 /jg of 
superceded vector DNA for 10 min on ice, and electroporaied at 200V/960 > uF using 
a 3io-Rad gene pulser. Cells were transferred into gelatinized plates and allowed to 
recover overnight before addition of appropriate selection agent. Histochemical 
staining for /?-galactosidase was carried out with 5-bromo-4-chloro-3-indolyl J3-D- 
galactopyranoside (X-ga!) (13) , and /?-gaiactosidase activity was measured by 
incubation of cell extracts with o-nitrophenyl-/?-D-galactopyranoside (ONPG). 
Differentiation was induced in monolayer culture as described (12) . 

Analysis of episomal vectors in the supertransfectants. 

Hint supernatants were prepared as described (14) . For amplification of recovered 
episoma! vectors, electrocompetent E. colt DM 103 cells were transformed by 
eiectroporation at 2500V/25//F/200!4. 

Results 

Construction of an episomal expression vector. 

Poiyoma-based plasmids have recently been reported to be competent for episomal 
propagation in ES cells (8) . The plasmid pMGD20neo contains a modified large T 
expression unit called LT20, the viral origin of replication (Or/"), and the PGKneopA 
cassette as a selectable marker. This plasmid can be maintained as an 
extrachromosomal element in wild-type ES cells. It can be modified to include a 
cDNA expression unit (9) . However, the low frequency of conventional stable 
transfection of ES cells (A 1 x 10°) remains a limiting feature. Furthermore, episomal 
propagation only occurs in 10-15% of primary transfectants (8,9) . 

A second plasmid has been described which can be maintained as an episome only 
in ES cells which independently* express the large T protein (8) . This plasmid, 
PGKhphALT20, contains LT20 with a large deletion in its coding sequence, On\ and 
PGKhphpA as a selectable marker. When introduced into a cell line such as MG1 .1 9, 
in which episoma! maintenance of pMGDneo has already been established, the yield 
of hygromycin B resistant stable transfectants is extremely high. This phenomenon 
of supertransfection is presumed to arise from the pre-existence of large T protein in 
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tne reciDien: cells. 



In the studies reported below the modification and use of supertransfection vectors 
for cDNA expression is characterised. 

Size of vector 

PGKhph ALT 20 retains part of the large T coding sequence. We made a series of 
deletions in the ALT20 sequence to minimize the vector size and thereby increase 
the capacity for inserts and reduce potential bias in the construction and screening 
of cDNA libraries. The supertransfection efficiency of four derivative plasmids was 
then compared in MG 1.1 9 cells. All showed comparable supertransfection efficiency 
to PGK/>pA7ALT20 (data not shown). The smallest, pLT20AA/delnp/). has a deletion 
of 2953 bp, yielding an episomal vector backbone of only 4.7kb. 

Expression unit 

Into this minimal episomal vector we introduced a cDNA expression unit 
Transcriptional initiation signals are supplied by the CAG cassette(11) , which 
comprises the human cytomegalovirus immediate early enhancer, a 1Kb fragment of 
the chicken £-actin gene (promoter, non-coding first exon and first intron). and a 
splice acceptor derived from the rabbit /?-globin gene. This combination has been 
shown to direct strong expression of cDNAs in undifferentiated stem cells. The 
resulting expression vector, pHPCAG (Fig 1), contains the CAG sequences followed 
by the SsrXI stuffer sequence derived from pCDM8 as a cDNA cloning site, and a 
polyA addition signal derived from the rabbit yS-globin gene. In addition the plasmid 
contains the PGKhphpA (15) cassette for hygromycin selection of.ES cell 
transfectanis, the polyoma Ori with pyF1 01 -derived mutant enhancer element (16) for 
stable episomal replication in cells expressing polyoma large T protein, and the /?- 
lactamase (amp) gene and prokaryotic replication origin for amplification in E. coh. 
The SV40 Ori is also present to allow for transient episomal replication in mammalian 
host cells expressing SV40 largeT, such as COS cells (17) . 



W O 98/32868 PCT/GB98/00216 

- 21 - 

Characterization of supertransfection. 

The parameters of supertransfection with pHPCAG and derivatives were investigated. 
First, 5x1 0 5 MG1.19 ceils were eieciroporated with various amount of supercoiled 
pHPCAG, selected in medium containing 80 //g/ml of hygromycin B for 8 days, and 
the number of stem cell colonies scored after Leishman's staining(12) . Although the 
highest efficiency per fjg DNA was observed with minimum amounts (1-2 fjg) of 
vector DNA (Fig. 2B), the total yield of hygromycin B resistant colonies increased with 
increasing amount of plasmid (Fig 2A). Saturation was not reached over the range 
of plasmid concentrations tested. With 100 fjg plasmid DNA, 150,000 hygromycin 3- 
resistant colonies were obtained, representing 3% of total treated cells. Disablement 
for episomal replication by linearisation of pHPCAG prior to electroporation reduced 
this transfection efficiency to less than 0.01%. 

Next, increasing numbers of MG 1.19 cells were subjected to electroporation with 100 
fjg of pHPCAG DNA. Comparable stable transfection efficiencies in the range 3-6% 
were obtained with up to 2.5x1 0 7 cells. 

The copy number of pHPCAG in the supertransfectants was analyzed by preparation 
of Hirt supematants followed by filter hybridisation. This analysis revealed that 
supertransfected cells carried approximately 20 copies each of pMGDneo and 
pHPCAG (Fig. 3). 

These data demonstrate that the efficiency of supertransfection with pHPCAG is 
extremely high. However, episomal vectors can be limited in their capacity for inserts 
because increased size may cause inefficient replication or instability. To investigate 
this issue in the ES ceil system, episomal vectors of different size were 
supertransfected into MG 1.19 ceils. The numbers of supertransfectant colonies were 
scored and plotted against vector size (Fig. 4). These data indicate that there is a 
progressive reduction in transfection efficiency with increasing plasmid size. In 
particular, the largest plasmid tested, a derivative of pHPCAG with a 3kb lacZ insert 
(total size 10.4kb) showed a 50% reduction in colony number. However, that this 
may not be due entirely to the size of the plasmid because the very high levels of /?- 



WO 98/32868 



# 




PCT/GB98/00216 



- 22 - 



calactosidase expression may exert s 



cme toxic effects (see below). 



lacZ expression in supertransfectants. 

To evaluate the level and pattern of expression of transgenes from pHPCAG, the 
E.coli £-galactosidase (lacZ) gene was introduced into this vector. The resulting 
vector, pHPCAG-/acZ, was introduced into MG1.19 cells and supertransfectants 
isolated by selection with 80 //g/ml of hygromycin 3 for 8 days. The number of 
colonies isolated was 50% of the number obtained in a parade! supertransfection with 
pHPCAG (see above). The colonies were smaller and many of the cells showed an 
abnormal spindle-shaped morphology. These effects were not observed with several 
other inserts in pHPCAG and are suggestive of a toxic effect of the high level lacZ 
expression. The primary supertransfectants were stained with X-gal and the staining 
pattern examined under phase-contrast microscopy. Staining was detectable after 
5 minutes incubation and was intense by 1 hour. This level of /?-galactosidase 
activity is significantly higher than we have observed from a variety of integrated 
expression constructs. 

Approximately 80% of supertransfectant colonies showed ubiquitous expression 
(>90% eel! positive) as shown in Fig.5-A (i). Of the remainder, 15% showed 
heterogeneous expression (Fig.5-A (ii)), and 5% showed little or no staining (Fig.5-A 
(iv)). The latter two classes are likely to arise as a result of vector integration which 
occurs in up to 20% of supertransfectants (8). In transfectants derived by 
electroporation of linearized pHPCAG-/acZ into MG1 .19 cells (which results in vector 
integration in the majority of clones), only 15 % of colonies showed homogeneous 
staining whereas 70% of colonies stained heterogeneously (Fig.5-A (iii)), and 15% 
showed no expression. 

Analysis of expanded clones from each class of transfectant established that this 
difference in expression characteristics was stable. Twelve of '13 expanded 
supertransfectants expressed lacZ homogeneously. In contrast, only 4 out of 24 
clones derived using linearized vector showed homogeneous expression. This is 
consistent with our previous observations on integrated expression constructs in ES 
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cells. In fact the CAG unit gives a significantly higher frequency of colonies which 
show stable ubiquitous expression than other promoters we have examined. 



The difference in staining pattern between episomally maintained and integrated 
vectors indicates that the former escape modifying influences arising from integration 
and reliably give full activity of the expression unit. 

Comparison of expression with various promoters on episomal vector. 

An ability reliably to generate predetermined levels of expression would be a 
important attribute for a transgene expression system. The previous observations 
suggested that episomal vectors offered potential to achieve unmodified expression. 
Various promoters with different strengths in undifferentiated stem cells were 
therefore introduced into the episomal vector by replacing the CAG expression unit 
of pHPCAG-/acZ. Expression of the lacZ reporter was then assayed in both transient 
and stable supertransfectants (Table 1). The relative ratio of j!?-galactosidase activity 
obtained from the SV40 enhancer/promoter complex, the human /?-actin promoter, 
the mouse PGK-1 promoter and the HSV-tk minimal promoter in transient transfectant 
was retained in stable supertransfectants. The CAG expression unit showed 
strongest activity in the tested constructs in both transient and stable transfectants. 
In this case, however, the relative ratio in transient transfectants, 1 9 times higher than 
SV40, was significantly reduced in stable transfectants. This may arise from an 
elimination of strong expressants due to a toxic effect of high lacZ expression (see 
above). A reduced number of supertransfectants and smaller size of colonies was 
observed only with the CAG vector. 

Stability of supertransfected episomal expression vector during long-term 
culture and differentiation of host cells. 

A critical limitation of previously described episomal vectors is their instability during 
long-term culture. Many episomal vectors undergo integration into the host genome 
after long-term culture, resulting in a reduction in expression and inability to recover 
transgenes by preparing Hirt supematants. To test the stability of the 
supertransfection system, four pHPCAG-/acZ supertransfectant clones were cultured 



WO 9832868 PCT/GB98/00216 



for 50 days (approximately 90 generations) under continuous selection with 80 //g/ml 
of hygromycin B. Three of the four clones maintained relatively constant levels of /?- 
galactosidase activity determined by ONPG assay and uniform expression as 
revealed by Xgal staining. The fourth clone showed unstable and variegated 
expression, as commonly observed on vector integration. Hirt supernatants were 
prepared from one of the stably expressing clones at the end of the 60 day culture 
period. Filter hybridization analysis of the Hirt DNA indicated that the ES cells carried 
approximately 20 copies of pMGD20 and 5 copies of pHPCAG-/acZ per cell (data not 
shown). The lower copy number of pHPCAG-/acZ may be due to its larger size 
and/or the toxic effect of strong lacZ expression. The Hirt DNA was transformed into 
E. co// for further analysis. Of the bacterial transformants, 20% carried pHPCAG-/acZ 
and the remainder carried pMGDneo20, in good agreement \fcith the hybridization 
data. Restriction mapping showed no evidence of rearrangement in either plasmid 
(Figure 6). 

In the experiment above, ceils were maintained under selection with hygromycin B. 
In the absence of selection pressure, supertransfectant clones lost expression of /?- 
gaiactosidase over several passages in culture. This might indicate an intrinsic 
instability of supertransfected episomal vectors. However, it could also reflect a 
selective disadvantage for ES cells which express high levels of /?-ga!actosidase. It 
is noteworthy in this regard that the primary episome, pMGD20neo, is stable in the 
absence of se!ection(8]~7 

Stability of expression from pHPCAG-/acZ during the in vitro differentiation of ES cells 
was also analyzed. Differentiation was induced in three ways: withdrawal of LIF; 
exposure to retinoic acid; and treatment with 3-methoxybenzamide(18) . After 6 days 
the differentiated progeny stained ubiquitously in all three cases (data not shown). 

These data indicate that supertransfected episomal vectors can be maintained in an 
extrachromosomal state and direct strong expression of transgenes during long-term 
self-renewal and differentiation in vitro. 



WO 98/32868 PCT/GB98/00216 

Production and secretion of the cytokine LIF from an episomal ES cell 
expression vector. 

The pHPCAG-/acZ piasmid can efficiently direct strong and homogeneous expression 
of the cytoplasmic lacZ reporter gene. We next investigated expression of a secreted 
molecule, the cytokine LIF. LIF is an essential supplement to ES cell culture medium 
because it inhibits differentiation of the stem cells (19,20) . Expression of LIF can 
readily be assayed by formation of stem cell colonies in media lacking the cytokine. 

Episomal vectors for expression of another cytokine, interleukin-2 (which has no 
effect on ES cell phenotype), and for LIF were eiectroporated in parallel into MG1.19 
cells. The cells were seeded at low density (1 .5x1 0 4 and 5x1 0 3 cells per 90mm plate) 
to avoid the rescue effect which arises from the production of LIF by differentiated ES 
cell progeny (21) , and cultured with 80 //g/ml of hygromycin B for 8 days. pHPCAG- 
112 generated large numbers of stem ceil colonies in medium supplemented with LIF, 
but none in the absence of LIF. pHPCAG-//f in contrast produced comparable 
numbers of healthy stem cell colonies in both the presence and absence of 
exogenous LIF (Table 2). These colonies could be expanded and propagated without 
LIF-supplementation of the medium. These data confirm previous observations that 
increased autocrine expression of LIF renders ES cells factor-independent (22) and 
establish that secreted proteins are produced efficiently and stably by this episomal 
expression system. 

Co-supertransfection of episomal vectors. 

Introduction of two or more different transgenes into ceils is often required for 
analysis 'of protein interactions and/or co-operative function. The poor efficiency of 
homogeneous expression in conventional transfectants is a major obstacle for such 
investigations in ES cells. To test the possibility that the episomal approach could 
be applied to co-express multiple cDNAs, we constructed episomal expression 
vectors with different selection markers. Co-supertransfection of episomal vectors 
was then assessed. 



he basic episomal expression vector pHPCAG carries the hygromycin 
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phosphotransferase gene driven by mouse PGK-1 promoter (PGKhphpA). We 
prepared episoma! vectors which carry the zeocin-resistance gene driven by the 
human cytomegalovirus immediate-early promoter (pZPCAG), or the biasticidin S- 
resistance gene driven by the SV40 enhancer/promoter (pBPCAG) by substitution 
of the PGKhphpA cassette in pHPCAG. These vectors were supertransfected into 
MG1.19 cells followed by S days selection with the appropriate antibiotic. 
Comparison of the numbers of resulting drug-resistant colonies (Table 3) revealed 
that these selection systems are slightly less efficient than hygromycin B selection but 
nonetheless enable large numbers of supertransfectants to be isolated. 



ES cells harbouring two different episoma! vectors can be isolated by repeated 
supertransfection. Supertransfectants carrying pHPCAG can be transfected again 
with p3PCAG or pZPCAG, with comparable efficiency to the original supertransfection 
into MG1.19 ES cells (data not shown). This should allow establishment of efficient 
screens for assaying functional interactions between gene products. 

The effects of co-electoporation of supertransfection vectors were also investigated. 
pHPCAG (10 fig) and pBPCAG (10 fig) were co-eiectroporated into 5x10 6 MG1.19 
cells. Cells were selected in hygromycin B or biasticidin S only, or both, for 8 days 
and the number of drug-resistant colonies scored in each case. The numbers of 
hygromycin or biasticidin S single-resistant colonies were 39,000 and 13,000, 
respectively, while the number of double-resistant colonies was 1,200. Thus the 
apparent efficiency of incorporation of both plasmids was less than 10%. Similar 
results were obtained on co-supertransfection of pHPCAG and pZPCAG (not shown). 
These data suggest that the majority of supertransfectants incorporate only one 
plasmid under these electroporation conditions. This is significant for application of 
the episomal system to functional cDNA library screening. 

EXAMPLE 2 

i he effects of overexpression of a large number of transgenes in ES cells were 
investigated by construction of vectors based on pHPCAG and including a DNA insert 
coding for the transgene being investigated. 5 x 10 5 ES MG1.19 cells were 
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supenransfected with 20 //g of expression vectors and selected with 80 //g/ml of 
hygromycin 3 for 8 days. The numbers c: drug-resistant colonies were counted and 
normalised relative to numbers obtained with empty vector. The results are shown in 
Table 4. 

EXAMPLE 3 

Inhibition of ST AT3 activation blocks self-renewal and promotes differentiation 

To assess directly the requirement for STAT3 activation in ES cell self-renewal, we 
exploited a dominant interfering mutant form of STAT3, STAT3F. In this mutant 
(Minami et a!., 1995), the tyrosine residue at amino acid position 705 is mutated to 
phenylalanine. Phosphorylation of Tyr705 is required for dimerizaiion and nuclear 
translocation. When expressed at high level, STAT3F has bees shown to block the 
activation of endogenous STAT3 in various cell types, possibly by titrating out 
receptor docking sites (Fukada er a/., 19S6; Minami et a/., 1996; Nakajima et a/., 
1996; Bonni et a/., 1997; Ihara er a/., 1997). 

Using conventional transfection approaches we were unable to recover ES cell 
transfectants showing stable high level expression of STAT3F. In parallel 
experiments, however, transfection of the LIF-independent embryonal carcinoma cell 
line P1 9 yielded multiple expressing clones. This suggested that blockade of STAT3 
activation in ES cells specifically resulted in cell death, growth arrest or differentiation. 
The transfection and expression strategy of the invention was therefore adopted to 
enable characterisation of the consequences of STAT3F expression. 

The STAT3F mutant cDNA was introduced into the supertransfection vector pHPCAG. 
The wild type STAT3 coding sequence was also introduced, in both sense and 
antisense orientations. The three constructs were electroporated into MG1.19 cells 
which harbour a large T expression plasmid and can be supertransfecied with 
constructs containing the polyoma origin (Gassmann et a/., 1 995). Supertransfectants 
were isolated by selection in hygromycin B for 8 days in the presence of LIF. 
Colonies were fixed, stained with Leishman's reagent, counted, and scored for the 
presence of stem cells and differentiated cells. More than 95% of colonies obtained 
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following supertransfecticn with ccnirci or wild type STAT3 vector were stem cell 
colonies (Figure 7A). A modest increase in the proportion of differentiated colonies 
was obtained with the antisense construct. The STAT3F vector, however, yielded 
predominantly differentiated colonies. A decrease in total number of colonies was also 
observed after supertransfection with STAT3F. This may reflect an early onset of 
differentiation which would produce very small clones that would not be scored. 
Alternatively, very high levels of STATS F expression may also be toxic, though this 
has not been reported in other cell types. Morphologically, the differentiated STAT3F 
colonies closely resembled the differentiated colonies generated on culture of ES 
cells in the absence of LIF (Figure 7C). Various other cDNAs have been expressed 
in ES cells using this system, with little or no effect on differentiation (data not 
shown). This suggested that the effect on differentiation was specifically attributable 
to expression of STAT3F. 

The differentiation induced by expression of STAT3F was examined further by 
expression analysis of the marker genes rax1 and H19. Rex-1 mRNA, which is 
specifically expressed in undifferentiated stem cells, was down regulated in STAT3F 
supertransfectants. In contrast, H19 RNA which is found at low levels in stem cells 
but is unregulated during differentiation, was increased (Figure 73). A similar pattern 
of gene regulation is observed during differentiation of ES cells induced by withdrawal 
of LIF. These data confirm that the morphological differentiation triggered by STAT3F 
is accompanied by reprogramming of gene expression. 

STAT3F was also expressed from the mouse phosphoglycerate kinase (pgk-1) 
promoter in the episomal vector ph'PPGK. This vector gives at least 10-fold lower 
expression than pHPCAG (data not shown), in this case, there was no significant 
effect on either colony number or differentiation status of MG1 .1 9 supertransfectants. 
A critical level of expression of the dominant interfering mutant therefore appears 
necessary to block self-renewal. 
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Effect of STAT 3F on self-renewal is suppressed by co-expression of STAT3 

To test whether the induction of differentiation by expression of STAT3F was due to 
an inhibition of endogenous STAT3 activity, we attempted to rescue the stem cell 
phenoiype by co-expression of wild type STAT3 and aiso of STAT1 and STAT4. A 
S7AT3F expression vector carrying a biasticidin resistance marker was co- 
supenransfected into MG1.19 cells with episomal constructs for expression of wild 
type STATs and hygromycin resistance. Co-supertransfectants were isolated in 
medium containing both 20>g/ml of biasticidin S and 80//g/ml of hygromycin B. The 
numbers of stem cell and differentiated colonies were scored after 8 days. As shown 
in Figure 8, only co-expression of wild type STAT3 restored self-renewal in the 
presence of STAT3F. Transferiion with STAT1 or STAT4 constructs alone had no 
effect on self-renewal in the absence of STAT3F (not shown) and did not alter 
differentiation induced by STAT3F. In the case of supertransfection with the CAG 
promoter STAT1 construct, the total number of colonies (stem + differentiated) 
recovered was reduced but the relative proportion of stem cell colonies versus 
differentiated cells was unaltered. This occurred in both the presence and absence 
of co-expression of STAT3F, and suggests that high level expression of STAT1 may 
be toxic to ES ceils. By using the mouse PGK-1 promoter to drive lower levels of 
expression comparable numbers of colonies were recovered on transfection with the 
STAT1 as with the other constructs, in this case, again only the STAT3 construct 
showed any restoration of stem cell colonies, although to a lower degree than with 
the high expression CAG vector (not shown). These data indicate that STAT3 has a 
specific function in ES cells which cannot be compensated by STAT1 or STAT4. 

EXAMPLE 4 

The invention is also used in a strategy for direct selection of genes that code for 
secreted and cell surface proteins. In one example of this strategy, the basic cloning 
vector is a truncated form of IL6R thai lacks a signal sequence. This vector is 
described in detail below and shown in Fig. 11. If this truncated IL5R is expressed 
in ES cells, it is not exported to the celi surface and these cells differentiate when 
cultured in IL6. However, if the IL5R signal sequence is reconstituted by a signal 
sequence provided by a cDNA fragments cloned in frame at the 5' end of the 
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truncated IL5R, the chimaeric receptor is expressed on the surface of ES cells. ES 
ceils containing such chimaeric receptors are thus maintained as undifferentiated 
colonies when cultured in IL5. 

Libraries of short, 5' cDNA fragments are produced and cloned into a truncated and 
modified IL6R-based expression vector. ES cells transformed with such libraries 
express cDNA;IL6R fusion proteins. However, only cDNAs that encode signal 
sequences confer !L6 responsiveness on ES cells. These cDNAs alone give rise to 
undifferentiated, proliferating ES cell clones. This strategy therefore provides a direct 
selection for cDNAs encoding secreted and cell surface proteins. 

The chimaeric IL6R is expressed in the episomal expression system described above 
(or a derivative thereof). This allows drug selection for episomally transformed ceiis 
and high level expression of cloned DNA. 

To further refine the selection system, ES cells are modified with two targeted 

mutations: 

a) A selectable marker gene, for example the biasticidin resistance gene, is 
introduced into the OCT-4 locus by standard targeting techniques. Since Oct-4 is 
expressed in undifferentiated ES cells, the biasticidin resistance gene will be 
expressed only by undifferentiated colonies. Biasticidin selection therefore is used to 
decrease background growth by ensuring rapid deletion of differentiating, Oct-4 
negative, ES cells. 

b) Since ES cells can produce LIF as an autocrine growth factor, ES cells are used 
in which both copies of the L!FR gene have been disrupted by gene targeting. This 
eliminates the possibility of LIF-dependent, false positive colonies that might 
otherwise persist throughout selection in IL5. 



Details of vector construction: 

1). IL6R was cloned into the episomal vector pCAGIP or a derivative (pCAGIPXN, 
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i.e. pCAGi? with a destroyed Noil site). pCAGIP contains an internal ribosome entry 
site (IRES) and a puromycin resistance gene downstream of its multiple cloning site, 
resulting in stoichiometric production of cDNA:IL6R fusion proteins in transfected cells 
under puromycin selection. IL5R in pCAGIP provides a positive control (IL6- 
responsive functional protein on the cell surface), and the basis of the new vector. 

2) . To construct the cloning vector, IL5R cDNA was truncated by cleavage with 
3ssHII at nucleotide number 92. This deleted the initiator ATG and sequences 

encoding the signal sequence. 

\ 

3) . To minimise potential ste\^c interference by cfoned proteins with IL6 binding and 
ILSR function, DNA encoding a Wnthetic flexible linker peptide was then added to the 
5' end of the truncated IL6R. TWO alternative linkers have been used: gly giy gly giy 
ser gly gly giy gly ser and a Iink^ containing the FLAG epitope, gly ser ASP TYR 
LYS ASP ASP ASP ASP LYS (FLAfe epitope in upper case). The sequence of these 
linkers is shown in Fig. 9. In each Wise, the linker sequence has been cloned in 
frame with IL6R and has two unique cloning sites (Xhol and Notl) at its 5' end, 

allowing the introduction of cDNA libraries, or specific cloned sequences, in a 

... \ 

cirectional manner. The FLAG epitope 1? recognised by a commercially available 
monoclonal antibody (M2; available from l^l/Kodak) regardless of its position within 
a fusion protein, and will thus allow the ex^ession levels of surface protein to be 
measured directly by immunocytochemistry. \ 

4). Vectors containing each of these linkers and an upstream signal sequence are 
tested for relative expression level and ILSR-function, as detailed below. 

To test the utility of these vectors for selecting proteins expressed at the cell surface, 
a number of known signal sequences are cloned into each vector. These are tested 
for surface. expression and 1L6R function. Signal sequences include those from rat 
CD4 (a protein with extracellular Ig domains), mouse sek (a receptor tyrosine kinase, 
with no extracellular Ig domains) and mouse sonic hedgehog (a secreted factor). 
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=S cells are transfected with vectors 



bearing candidate signal sequences by 



lipofection or eiectroporation, followed by puromycin selection for transfected cells. 
After overnight growth in the presence of LiF, to maintain the undifferentiated state 
and proliferation, transfected cells are split into three groups and treated with either 
1) LIF, 2) IL6 or 3) neither growth factor. Only cells bearing IL6R brought to the cell 
surface by a fused signal peptide will proliferate in the presence of IL6. Positive 
controls include ES cells transfected with wild-type IL6R grown in the absence of LIF 
and the presence of IL6. Negative controls include empty vector (i.e truncated IL6R 
with no 5' insert) grown in the presence of IL5. To determine whether fusion proteins 
N-terminal to IL5R block signalling (by steric hindrance), the proportion of such cells 
that express surface protein but fail to proliferate in response to IL5 is deduced by 
comparing the number of cells expressing the FLAG epitope With the number that 
give rise to colonies. 

Vectors defined by this assay are then used in cDNA library screens. Preferably, 
sequences corresponding to 5' ends of cDNAs are generated from full length' cDNA 
libraries and directionaliy cloned in the screening vector. 
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We have thus described the development of an optimised transfection and expression 
system which will enable high throughput functional screening of cDNAs in 
pfuripotential mouse embryonic stem (ES) cells and differentiated derivatives. The 
strategy is based on extrachromosomal vector replication driven by expression of 
polyoma large T protein. When a vector containing a polyoma origin of replication 
is introduced into an ES cell line that harbours polyoma large T antigen, a high 
frequency of stable secondary transfection results. This process is referred to as 
supertransfection. Supertransfected plasmids can be maintained episomally during 
long-term culture and during differentiation in vitro. Expression of a y!?-galactosidase 
reporter from an episomal vector is both ubiquitous and stable, in contrast to the 
variegated and unstable expression usually observed after cDNA integration into the 
ES cell genome. Moreover, in the absence of integration, promoter strength is 
predictable and a range of expression levels can reliably be achieved by using 
different elements. We also show that episomal vectors can be used for efficient 
expression of both cytosolic and secreted proteins. These features should make this 
system invaluable for functional analyses of defined cDNAs and for direct expression 
screening of cDNA pools or libraries in ES cells. 
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Table 1. Comparison of /?-gaiac:osidase activities directed by various promoters in 
transient and stable supertransfectanis. 



Promoter Relative /?-ga! activity 



transient stable 

SV40 e/p 1.0 1.0 

h/?Ap 1.1 0.7 

mPGKp 0.5 0.5 

TKp 0.1 0.1 

CAG 19.0 1.8 



5x1 0 3 MG1.19 ES cells~were supertransfected with 20//g of vector DNAs. After 3 
days culture for transient expression assay or 8 days selection with hygromycin B for 
stable expression assay, the /?-galactosidase activity generated by these constructs 
was measured by ONPG assay. Results are normalised relative to activity generated 
by the SV40e/p construct. See 'Materials and methods' for construction details of 
vectors. 
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Table 2. Supertransfection of LIF and IL-2 expression vectors into MG1.19 
ES cells. 



Vector LIF in medium No. of hyg' stem cell colonies 

pHPCAG-//f + 42,000 

pHPCAG-//f - 33,000 - 

pHPCAG-//2 + 48,000 

pHPCAG-/72 " - 0 

5x10° MG1.19 ES cells were supertransfected with 20^/g of vector DNAs. After 8 
days selection with 80//g/ml of hygromycin B in the presence or absence of LIF, the 
number of stem cell colonies were scored. 
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Table 3. efficiency of supertransfeciion of vectors with various selection markers. 



Selection marker Drug for selection (//g/ml) No. of resistant colonies 



PGKhphpA hygromycin 3 (80) 50,000 

SV6srpA blasticidin S (4) 12,500 

hCMVzeopA zeocin (20) 20,500 



5x10° MG1.19 ES cells were supertransfected with 20//g of vector DNAs of episomal 
vectors, pBPCAG and pZPCAG, which carry bsr and zeo resistance genes 
respectively. After 8 days selection with the appropriate drug, the number of drug- 
resistant stem cell colonies were scored. 
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Table 4. Effects of overexpression of transgenes in ES cells using pHPCAG. 



cDNA Relative number of Colony Size and 

hygro R colonies Morphology 



None 




1 .00 


Normal 


lacZ 




0.54 


<^ m a f i 

Of t 1 Cl 1 


DIA/LIF 




0 87 


ouy t my bmaii 


IL-2 




0.92 


c;fir , n1'l\/ email 
ouyiiuy oil la,! I 


Rex-1 




0.88 


Normal 

t N W till Q I 


Fgf-2 






M o r*m o I 
IM U 1 i II dl 


Faf-4 
i yi 






iMormai 


Faf-5 




n ai 


Normal 


Oct-1 




n 1 7 

W.I/ 


small 


Oct-2 




0 S5 


QllnHtlw email 

oiiy i uiy oil tail 


Oct-3/4 




0 61 


UU izJruii UalcQ 


Oct-6 




0 03 


oUlllc; U II I til cl UidllOn 


c-jun 




0 47 


c m a 1 1 

Oil i ctl 1 


E1 A 




0.08 


riiff rironti^tpH 

Ull 1 ^ 1 O t 1 UGICU 


Jak2 K/E 




0 7^ 


In urmai 


bcl-2 




0.28 


small, spindle morphology 


MAPKP 




1.38 


Normal 


RXRa 




0.20 


some differentiation 


RXRp 




0.63 


Normal 


RXRy 




0.91 


Normal 


COUP-TF1 




0.40 


some differentiation 


HNF-4 




0.05 


Normal 


Statl 




0.10 


small 


Stat3 




0.52 


Normal 


Stat4 




0.16 


Normal 


Stat3DON* 




0.14 


differentiated 



:10° ES MG1.19 cells were supertransfecied with 20 ug of expression vectors and selected 
th 80 ug/ml of hygromycin B for 8 days. The numbers of drug-resistant colonies were 
'unted and normalised relative to numbers obtained with empty vector. 
tat3DON is the dominant interfering mutant form of Stat3 described by Akira et ai (1995). 



